An Analysis of Statistical Models and Features for Reading Difficulty Prediction
نویسندگان
چکیده
A reading difficulty measure can be described as a function or model that maps a text to a numerical value corresponding to a difficulty or grade level. We describe a measure of readability that uses a combination of lexical features and grammatical features that are derived from subtrees of syntactic parses. We also tested statistical models for nominal, ordinal, and interval scales of measurement. The results indicate that a model for ordinal regression, such as the proportional odds model, using a combination of grammatical and lexical features is most effective at predicting reading difficulty.
منابع مشابه
Stock Market Modeling Using Artificial Neural Network and Comparison with Classical Linear Models
Stock market plays an important role in the world economy. Stock market customers are interested in predicting the stock market general index price, since their income depends on this financial factor; Therefore, a reliable forecast in stock market can be extremely profitable for stockholders. Stock market prediction for financial markets has been one of the main challenges in forecasting finan...
متن کاملSparse Structured Principal Component Analysis and Model Learning for Classification and Quality Detection of Rice Grains
In scientific and commercial fields associated with modern agriculture, the categorization of different rice types and determination of its quality is very important. Various image processing algorithms are applied in recent years to detect different agricultural products. The problem of rice classification and quality detection in this paper is presented based on model learning concepts includ...
متن کاملPersonality Traits and Multiple Intelligences as Predictors of Reading Proficiency among Iranian EFL learners
The present study investigated the relationship between personality traits and multiple intelligences, and learners’ reading proficiency. To this end, 384 graduate EFL students participated in the present study. Two questionnaires, namely the NEO personality inventory-revised, and McKenzie’s (1999) MI inventory as well as a sample TOFEL reading comprehension test were used to collect the data. ...
متن کاملInvestigating Hortatory Force In The EFL Reading Passages
This study investigates some the reading passages in terms of hortatory messages. To this end a methodology based on Critical Discourse Analysis was adopted. The reading texts from ELT textbooks were examined through a model which drew on Fairclough's approach to CDA, specifically (Fairclough, 2003) in which three characteristic features of hortatory texts are introduced. The analysis reveals t...
متن کاملPrediction of the waste stabilization pond performance using linear multiple regression and multi-layer perceptron neural network: a case study of Birjand, Iran
Background: Data mining (DM) is an approach used in extracting valuable information from environmental processes. This research depicts a DM approach used in extracting some information from influent and effluent wastewater characteristic data of a waste stabilization pond (WSP) in Birjand, a city in Eastern Iran. Methods: Multiple regression (MR) and neural network (NN) models were examined u...
متن کامل